NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Measuring and Controlling Instruction (In)Stability in Language Model Dialogs

Li, Kenneth; Liu, Tianle; Bashkansky, Naomi; Bau, David; Viégas, Fernanda; Pfister, Hanspeter; Wattenberg, Martin (October 2024, COLM)

System-prompting is a standard tool for customizing language-model chatbots, enabling them to follow a specific instruction. An implicit assumption in the use of system prompts is that they will be stable, so the chatbot will continue to generate text according to the stipulated instructions for the duration of a conversation. We propose a quantitative benchmark to test this assumption, evaluating instruction stability via self-chats between two instructed chatbots. Testing popular models like LLaMA2-chat-70B and GPT-3.5, we reveal a significant instruction drift within eight rounds of conversations. An empirical and theoretical analysis of this phenomenon suggests the transformer attention mechanism plays a role, due to attention decay over long exchanges. To combat attention decay and instruction drift, we propose a lightweight method called split-softmax, which compares favorably against two strong baselines.
more » « less
Full Text Available
ChainForge: A Visual Toolkit for Prompt Engineering and LLM Hypothesis Testing

https://doi.org/10.1145/3613904.3642016

Arawjo, Ian; Swoopes, Chelse; Vaithilingam, Priyan; Wattenberg, Martin; Glassman, Elena L (May 2024, ACM)

Full Text Available
Inference-Time Intervention: Eliciting Truthful Answers from a Language Model

Li, Kenneth; Patel, Oam; Viégas, Fernanda; Pfister, Hanspeter; Wattenberg, Martin (December 2023, NeurIPS)
Emergent World Representations: Exploring a Sequence Model Trained on a Synthetic Task

Li, Kenneth; Hopkins, Aspen K.; Bau, David; Viégas, Fernanda; Pfister, Hanspeter; Wattenberg, Martin (May 2023, ICLR)

Full Text Available
GAN Lab: Understanding Complex Deep Generative Models using Interactive Visual Experimentation

https://doi.org/10.1109/TVCG.2018.2864500

Kahng, Minsuk; Thorat, Nikhil; Chau, Duen Horng; Viegas, Fernanda B.; Wattenberg, Martin (January 2019, IEEE Transactions on Visualization and Computer Graphics)

Full Text Available

Search for: All records